Discriminative Binaural Sound Localization
نویسندگان
چکیده
Time difference of arrival (TDOA) is commonly used to estimate the azimuth of a source in a microphone array. The most common methods to estimate TDOA are based on finding extrema in generalized crosscorrelation waveforms. In this paper we apply microphone array techniques to a manikin head. By considering the entire cross-correlation waveform we achieve azimuth prediction accuracy that exceeds extrema locating methods. We do so by quantizing the azimuthal angle and treating the prediction problem as a multiclass categorization task. We demonstrate the merits of our approach by evaluating the various approaches on Sony’s AIBO robot.
منابع مشابه
Estimating the azimuth of a sound source from the binaural spectral amplitude
A computational model of auditory localization based on processing spectral amplitude cues for localizing broadband high frequency sound sources is presented. The cues extracted are binaural spectral level di erence patterns of the Head Related Transfer Functions (HRTF) corresponding to the direction of the sound source. Four di erent pattern classi ers are used to evaluate the feasibility of l...
متن کاملTesting the Use of the Binaural Cross-Correlation Coeffiecnt in Azimuthal Sound Localization
Azimuthal sound localization studies were performed on 4 listeners for sounds recorded in two different rooms. The sounds had multiple values of binaural coherence ranging from 0.2 to 0.8 in each room. The sounds were presented to the listener, followed by the same sound with a slight delay in either ear to create an interaural time difference. Previous studies performed with synthetically corr...
متن کاملChapter 5 Binaural Sound Localization
We listen to speech (as well as to other sounds) with two ears, and it is quite remarkable how well we can separate and selectively attend to individual sound sources in a cluttered acoustical environment. In fact, the familiar term ‘cocktail party processing’ was coined in an early study of how the binaural system enables us to selectively attend to individual conversations when many are prese...
متن کاملIntegrating Monaural and Binaural Cues for Sound Localization and Segregation in Reverberant Environments
The problem of segregating a sound source of interest from an acoustic background has been extensively studied due to applications in hearing prostheses, robust speech/speaker recognition and audio information retrieval. Computational auditory scene analysis (CASA) approaches the segregation problem by utilizing grouping cues involved in the perceptual organization of sound by human listeners. ...
متن کاملBinaural localization cues
12 Binaural localization cues 269 13 Sound location: monaural cues and spectral cues for elevation 301 14 Physiological correlates of the precedence effect and binaural masking level differences 331
متن کامل